Efficient Implementation of Voiced/Unvoiced Sounds Classification Based on GMM for SMV Codec
نویسندگان
چکیده
In this letter, we propose an efficient method to improve the performance of voiced/unvoiced (V/UV) sounds decision for the selectable mode vocoder (SMV) of 3GPP2 using the Gaussian mixture model (GMM). We first present an effective analysis of the features and the classification method adopted in the SMV. And feature vectors which are applied to the GMM are then selected from relevant parameters of the SMV for the efficient V/UV classification. The performance of the proposed algorithm are evaluated under various conditions and yield better results compared to the conventional method of the SMV. key words: selectable mode vocoder, Gaussian mixture model, voice activity detection
منابع مشابه
A Variable Rate Speech Codec Using Vus Classification
Voiced speech is highly correlated and must be reconstructed accurately in order to sound correct. Unvoiced speech on the other hand is noise like in nature. It can be approximated by white noise coloured by the vocal tract filter. Because of this lack of structure in unvoiced speech sounds, the excitation signal does not have to reproduce the speech signal as accurately as for voiced sounds. T...
متن کاملAcoustic Environment Classification Based on SMV Speech Codec Parameters for Context-Aware Mobile Phone
In this letter, an acoustic environment classification algorithm based on the 3GPP2 selectable mode vocoder (SMV) is proposed for context-aware mobile phones. Classification of the acoustic environment is performed based on a Gaussian mixture model (GMM) using coding parameters of the SMV extracted directly from the encoding process of the acoustic input data in the mobile phone. Experimental r...
متن کاملVoiced/Unvoiced and Silent Classification Using HMM Classifier based on Wavelet Packets BTE features
Wavelet Packets Best Tree Encoded (BTE) features is used here as base features for HMM classifier. The research aimed to introduce the newly designed features that are discussed in [1]. The considered problem is Voiced, Unvoiced and Silent classification. Comparison to the 19 filter banks features is provided. Although it is simple and straight forward, BTE makes comparable results to the 19 el...
متن کاملWavelet-Based Speech Enhancement Using Time-Frequency Adaptation
Recommended by Satya Dharanipragada Wavelet denoising is commonly used for speech enhancement because of the simplicity of its implementation. However, the conventional methods generate the presence of musical residual noise while thresholding the background noise. The unvoiced components of speech are often eliminated from this method. In this paper, a novel algorithm of wavelet coefficient th...
متن کاملA CELP variable rate speech codec with low average rate
This paper presents a variable-rate CELP codec which achieves good communications speech quality at an average rate of about 3 kb/s. The codec operates as a source-controlled variable rate coder with rates of 4.9 kb/s for voiced and transition sounds, 3.0 kb/s for unvoiced sounds and 670 b/s for silent frames. New techniques used in the codec include prediction of the xed codebook target vector...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEICE Transactions
دوره 92-A شماره
صفحات -
تاریخ انتشار 2009